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Review 

Potential of a mutant-based reverse genetic approach for functional genomics and 
molecular breeding in soybean 

Toyoaki Anai* 

Laboratory of Plant Genetics and Breeding, Faculty of Agriculture, Saga University, Honjyo-machi 1, Saga 840-8502, Japan 

Mutant-based reverse genetics offers a powerful way to create novel mutant alleles at a selected locus. This 
approach makes it possible to directly identify plants that carry a specific modified gene from the nucleotide 
sequence data. Soybean [Glycine max (L.) Merr.] has a highly redundant paleopolyploid genome (approx. 
1.1 Gb), which was completely sequenced in 2010. Using reverse genetics to support functional genomics 
studies designed to predict gene function would accelerate post-genomics research in soybean. Furthermore, 
the novel mutant alleles created by this approach would be useful genetic resources for improving various 
traits in soybean. A reverse genetic screening platform in soybean has been developed that combines more 
than 40,000 mutant lines with a high-throughput method. Targeting Local Lesions IN Genome (TILLING). 
In this review, the mutant-based reverse genetic approach based on this platform is described, and the likely 
evolution of this approach in the near fiiture. 
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Introduction 

Soybean {Glycine max (L.) Merr.) is one of the most impor- 
tant leguminous crops. As a legume it has symbiotic nitro- 
gen fixation ability, and is a major global source of proteins 
and oil for human and animal consumption. The genome se- 
quence of soybean (cv. Williams 82) was completed using a 
whole-genome shotgun method in 2010 (Schmutz et al. 
2010). Soybean's paleopolyploid genome, which is approx- 
imately 1.1 Gb long, comprises -46,000 predicted genes, of 
which nearly 75% are multiple copies. The integrated soy- 
bean genome database Soybase (http://www.soybase.org) 
provides public access to the nucleotide sequences and other 
useful information on soybean (e.g., mapped genes, 
orthologs and markers). The genome sequence information 
is useful for traditional (forward genetics) analysis. For ex- 
ample, Watanabe et al. (2009, 2011) reported map-based 
cloning of two genes for maturity quantitative trait loci using 
a residual heterozygous line. 

On the other hand, reverse genetic tools and resources 
have proven to be indispensable for discovering the biologi- 
cal fiinctions of sequenced genes and for developing novel 
alleles for specific traits. The soybean genome sequence is 
an essential tool for accelerating reverse genetic studies. Be- 
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cause the methods of reverse genetics were developed main- 
ly in studies of Arab idops is, many are based on transgenic 
technology (Table 1). However, transgenic technology is not 
a realistic approach for commercial crops such as soybean, 
because developing and maintaining a huge number of trans- 
genic plants is a time-consuming and labor-intensive pro- 
cess. In contrast, mutant-based reverse genetic methods 
provide a more practical and potent alternative. There are 
several major benefits of using a mutant population for re- 
verse genetic analysis: (1) The mutagenesis protocols are 
well established; (2) A large nimiber of independent mutants 
can be obtained simultaneously; (3) There is no legal restric- 
tions on the handling and transfer of the mutants, unlike with 
transgenic plants. Dozens of reverse genetic methods for 
screening mutant populations have been developed around 
different mutagens and mutant types. These include the 
Deletagene (Delete-a-gene) method (Li et al. 2001), high- 
resolution melting (HRM; Gady et al. 2009), comparative 
genomic hybridization (CGH; Bolon et al. 2011), and Tar- 
geting Local Lesions IN Genomes (TILLING; McCallum et 
al. 2000). 

This review provides a brief overview of recent advances 
in the reverse genetics of crop plants. Furthermore, a mutant 
collection and TILLING-based high-throughput screening 
system is described. This paper concludes with the likely fu- 
ture perspectives for the molecular breeding of soybean. 
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Table 1. Comparison of typical metliods used for 


reverse genetic ! 


sUidics ofp 


lanls 






Method 


Resource 
type 


Mutation 
specificity 


Mutation 
stability 


Allelic 
variations 


Screening throughput 


Reference 


Gene silencing 


Recombinant 


Uncontrollable 


Unstable 


Uncontrollable 


Depends on the host* 


Schwab et at. (2006) 


Overexpression 


Recombinant 


Uncontrollable 


Unstable 


Uncontrollable 


Depends on the host"^ 


Ichikawa et al. (2006) 


T-DNA tagging 


Recombinant 


Random 


Stable 


Limited 


Depends on the host* 


Sessions et al. (2002) 


Transposon tagging 


Recombinant* 


Random 


Stable 


Limited 


Depends on the host* 


Tissier etal. (1999) 


Zinc-finger nuclease 


Recombinant 


Specific 


Stable 


Possible 


Low 


Lloyd et al. (2005) 


Homologous recombination 


Recombinant 


Specific 


Stable 


Possible 


Very low 


Shaked et al. (2005) 


Deletagene 


Non-recombinant 


Random 


Stable 


Limited 


High 


Li etal (2001) 


TILLING 


Non-recombinant 


Random 


Stable 


Available 


Medium 


McCallum et al (2000) 



* In the case of rice, a highly active endogenous transposon (TosJ7) is available for use in the transposon tagging system (Miyao et al. 2007). 
t Transformation efficiency of the host plant species is the bottleneck for these systems. 



Reverse genetic screening for different types of mu- 
tant resource 

The size and complexity of a mutant resource is the most 
important factor for successfully screening mutants by 
means of the reverse genetic approach. Since the features of 
a mutant resource depend mainly on the types and doses of 
mutagens used to create it, the choice of screening method is 
also important. For example, ionizing radiation (e.g., X-rays, 
gamma rays and fast neutrons) induces mainly nucleotide 
deletions of various sizes at a relatively low density 
(Cecchini et al. 1998, Shirley et al. 1992). In this case, the 
large deletions (>5 kb) may completely eliminate multiple 
fiinctional loci and their functions, whereas the small dele- 
tions (1 to 100 b) lead to partial loss of nucleotide sequences, 
and frequently induce frame-shift, amino acid-truncation, or 
mis-splicing mutations that lead to complete loss of function 
of the target genes (Anai et al 2005, 2008, 2012). 

In contrast, several popular chemical mutagens [e.g., 
ethylmethane sulfonate (EMS) and 7V-methyl-iV-nitrosourea 
(NMU)] induce mainly single-base substitutions at high 
density (Greene etal. 2003, Satoh etal. 2010). The base sub- 
stitutions induce not only effective mutations such as non- 
sense, mis-sense, and mis-splicing mutations, but also gene- 
silencing mutations. These chemical mutagens are attractive 
because they can take advantage of smaller mutant popula- 
tions than the other methods. However, the detection of mu- 
tants carrying a specific base substitution or small deletion is 
more difficult than for large deletions. 

Two potent screening methods, Deletagene and CGH, are 
appropriate choices for obtaining mutants with large dele- 
tions. The PCR-based Deletagene approach is a simple high- 
throughput screening method that uses highly pooled 
(-2500 lines/pool) genomic DNAs prepared from mutant 
lines treated with fast neutrons as templates (Li et al. 2001). 
In this method, it is necessary to position both primers out- 
side of the deleted region; however, it is difficult to accurate- 
ly predict the size of each deletion, so designing the primers 
is a rate-limiting step. The hybridization-based CGH ap- 
proach is a traditional dot-blot-like method, which uses a 
high-density microarray containing about 700 000 unique 



probes to screen a population irradiated with fast neutrons 
(Bolon et al. 2011). Because of the low mutation rate in 
these two types of lines, it is necessary to use a large mutant 
population in both methods. In addition, the cost of a CGH 
microarray is currently so expensive that it is difficult to use 
the microarrays to directly screen a specific mutant from a 
large mutant population. 

Currently, the TILLING approach is one of the most 
attractive screening methods for mutant-based reverse ge- 
netics. This method has several advantages over the other 
methods used to detect single-nucleotide polymorphisms be- 
cause of its high sensitivity and throughput. TILLING was 
originally reported as a method to detect mismatches in het- 
eroduplex DNA when denaturing high-performance liquid 
chromatography was used (McCallum et al. 2000). Subse- 
quently, it was modified to use a mismatch-specific nuclease 
(CEL I) and the LI-COR gel imaging system with fluores- 
cent dye-labeled primers to improve its throughput (Colbert 
et al. 2001). CEL I can efficiently cleave both the base sub- 
stitutions induced by chemical mutagens and the small dele- 
tions induced by ionizing radiation (Oleykowski et al. 
1998). For this reason, the mutant resources used with the 
TILLING method have been developed mainly by means of 
treatment with EMS, but some research groups have suc- 
cessfully isolated several mutants from populations treated 
with other mutagens (Table 2). These results indicate that 
the mutant resources obtained from various mutagen treat- 
ments could be widely adapted to the TILLING method. 

A TILLING platform for soybean 

The TILLING approach has three main advantages for crop 
species: (1) It can generate a wide range of genetic alter- 
ations in addition to loss-of- function alleles, including hypo- 
morphic, hypermorphic and neo-morphic mutations. (2) It 
can produce permanent and heritable mutations. (3) It can be 
adapted to polyploid species and functionally redundant 
genes. Because these benefits are particularly attractive for 
crop improvement efforts, reverse genetics projects that use 
the TILLING method are now under way for many crop spe- 
cies, including barley, maize, pea, rice, soybean, tomato and 



464 



Anai 



Tabid. ()\cr\ic\\ ol'lhc mulant resources used m ihe TILLINCi nietliod m \arious planl species 



Mutagen 



Mutation type 



Plant species 



Mutagen dose Mutation frequency 



Reference 



Chemical agents 



EMS 


Point mutations 


Afdbidopsis 


20-40 mM 


3.3/Mb 


Greene et al. (2003) 


EMS 


Point mutations 


Barley 


20-30 mM 


1.0/Mb 


Caldwell et al. (2004) 


EMS 


Point mntfltifiTi** 




24-48 mM 


7.6-24. 1/Mb 


Wane et al ("2008^ 


EMS 


Point mutations 


Bfdssicci fcipct 


24-32 mM 


16.7/Mb 


Stephenson a/. (2010) 


EMS 


Point mutations 


Maize 


5 mM* 


0.9-2. 1/Mb 


Weil and Morde (2007) 


EMS 


Point mutations 


Medicago 


12 mM 


2. 1/Mb 


Le Signer et al. (2009) 


EMS 


Point mutations 


Pea 


4mM 


1.5/Mb 


Triques et al. (2007) 


EMS 


Point mutations 


Peanut 


30-96 mM 


0.9-1. 1/Mb 


Knoll a/. (2011) 


EMS 


Point mutations 


Rice 


130 mM 


3.8/Mb 


TiW etal. (2007) 


EMS 


Point mutations 


Sorghum 


8-24 mM 


1.9/Mb 


Xin et al (2008) 


EMS 


Point mutations 


Soybean 


40-50 mM 


1.8-7. 1/Mb 


Cooper et al (2008) 


EMS 


Point mutations 


Soybean 


30-40 mM 


0.3-1. 3/Mb 


Present study 


EMS 


Point mutations 


Tomato 


56-80 mM 


1.7-3. 1/Mb 


Minoiae^a/. (2010) 


EMS 


Point mutations 


Wheat 


60-96 mM 


41.7/Mb 


Slade et al (2005) 


EMS 


Point mutations 


Wheat (durum) 


60-80 mM 


25.0/Mb 


Slade etal (2005) 


NMU 


Point mutations 


Rice 


1 mM* 


7.4/Mb 


Suzuki et al (2008) 


NMU 


Point mutations 


Soybean 


2.5 mM 


7. 1/Mb 


Cooper a/. (2008) 


NaNj 


Point mutations 


Barley 


lOmM 


2.6/Mb 


TaXsmsetal. (2008) 


Physical agents 












y-rays 


Deletions / point mutations 


Rice 


500 Gy 


0.2/Mb 


Sato a/. (2006) 


X-rays 


Deletions 


Soybean 


200 Gy 


0.1/Mb 


Present study 



EMS, ethylmethane sulfonate; NMU, A'-methyl-TV-nitrosourea 

* These mutant populations were developed from progeny pollinated with mutagen-treated pollen. 



wheat (Table 2). In these projects, several thousands or tens 
of thousands of mutant plants are necessary to isolate specif- 
ic mutants. 

In soybean, Cooper et al. (2008) first reported the devel- 
opment of a TILLING system. They developed three EMS- 
treated populations and an NMU-treated population, and es- 
timated mutation rates ranging from 1.8 to 7. 1/Mb. Dierking 
and Bilyeu (2009) successfully isolated novel mutant alleles 
for the raffinose synthase gene {RS2) and the omega-6 fatty 
acid desaturase gene (FAD2-1A) using this system. We have 
developed a soybean mutant population consisting of more 
than 40 000 lines, and have developed an improved 
TILLING method adapted for high-throughput screening 
(Fig. 1). This mutant population contains six sub-populations 
that have mutation rates ranging from 0. 1 to 1 .3/Mb that were 
generated through the use of X-rays, EMS, or a combination 
of the two (Fig. 2). In the TILLING protocol, the cleavage 
with a mismatch-specific nuclease and the gel electrophore- 
sis are the most important steps. Generally, increasing the 
mutation density is thought to be a good idea to reduce the 
size of the population, but a mutant containing too many mu- 
tations would be difficult to analyze in terms of its pheno- 
type and difficult to use as breeding material. For these 
reasons, the mutation rate in the TILLING population must 
be carefully considered so that it will be appropriate for the 
purpose of the mutants that are obtained. The activity of the 
mismatch-specific nuclease has been detected in many plant 
species (Oleykowski et al. 1998). For TILLING, the most 
conmionly used CEL I was obtained from celery, and Yang 



et al. (2000) reported that CEL I preferentially recognized 
particular mismatches. Triques et al. (2007) reported that 
recombinant ENDOl, which is a mismatch-specific 
Arabidopsis endonuclease, strongly recognized many types 
of mismatches and was useful for TILLING. In our system, 
we detect CEL I-digested double-strand DNA fragments 
that are pre-stained with a highly sensitive fluorescent dye 
and separated by means of agarose gel electrophoresis 
(Fig. 1). We could detect all types of mismatch with this ap- 
proach. The possible reason of this result may be the pres- 
ence of two different heteroduplexes originated from both 
strands of PCR product and/or the higher CEL I concentra- 
tion optimized for double-strand cleavage system. However, 
the frequency of false-positive cleavages may be increased 
by the substrate specificity of CEL I. 

We have also constructed a high-throughput soybean 
TILLING system that enables a single person to screen 
about 6000 independent mutant lines in a day. We have suc- 
cessfully isolated some novel alleles of genes that control 
maturity (Watanabe et al. 2009, 201 1) and of a fatty acid de- 
saturase gene (Hoshino et al. 2010) using this platform. 

Future prospects for reverse genetics in soybean 

During the past decade, large-scale genome sequencing 
projects have been completed in several major crop species, 
including soybean, and a huge volume of nucleotide se- 
quence data corresponding to individual genes is now avail- 
able from public databases. Thus, the development of robust 
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Fig. 1. Outline of the development of a soybean mutant population and the process of mutant screening employing the TILLING approach. 
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Fig. 2. Summary of the soybean mutant populations developed at Saga University. * FukuHOLL is an experimental line that has been described 
previously (Hoshino et al. 2010). 
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tools for linking the nucleotide sequence of a particular gene 
to the phenotype produced by its alleles has been desired by 
the research community. The mutant-based reverse genetic 
approach described in this paper, which can create novel al- 
leles associated with "perfect markers" (Varshney et al. 
2006) for a particular gene, has the potential to allow breed- 
ers to improve a range of crops without using transgenic 
technology. This approach can therefore overcome the diffi- 
culty of conventional breeding programs, such as the func- 
tional redundancy of polyploid genomes and the pyramiding 
of multiple alleles. This approach may shift the perception of 
genetic resources in agricultural studies, because it brings 
about not only loss-of-function mutants with various degrees 
of impairment but also provides gain-of-fimction mutants. In 
fact, a novel mutant virus-resistance allele of the eIF4E gene 
was obtained from EMS-mutagenized tomato populations 
using the TILLING approach (Piron et al. 2010). 

To efficiently generate gain-of-function alleles from a 
mutant population, it is necessary to drastically increase the 
throughput of TILLING. In recent years, new DNA se- 
quencing technologies that provide very high throughput at 
a relatively low price, the so-called "next-generation se- 
quencing" (NGS) technologies, have been progressively de- 
veloped, and the cost of sequencing massive amounts of 
DNA has rapidly decreased (Ansorge 2009). This progress 
of NGS will have important implications for achieving next- 
generation reverse genetics, because the most secure way to 
identify mutations is by comparison of nucleotide sequences 
between wild-type and mutant alleles. Most recently, Tsai et 
al. (2011) reported novel TILLING method using NGS, 
lUumina GA platform, but this protocol contains cumbersome 
process for separation of mutation signal from large back- 
ground noise. The current read length of NGS is much 
shorter than that of Sanger sequencing, so a longer read 
length (at least 400 to 500 bp/read) will be necessary to adapt 
NGS to efficient reverse genetic screening. However, Schadt 
et al. (2010) believe that the read-length problem will be 
overcome within a few years, after which mass-sequencing 
of multi-dimensionally pooled mutant DNAs will become a 
major tool for reverse genetic screening. The mutant re- 
sources of soybean will become increasingly important for 
both functional genomics research and developing breeding 
materials as these tools mature. 
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